Experimental realization of optimal detection strategies for overcomplete states 
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We present the results of generalized measurements of optical polarization designed to provide 
one of three or four distinct outcomes. This has allowed us to discriminate between nonorthogonal 
polarization states with an error probability that is close to the minimum allowed by quantum theory. 
Employing these optimal detection strategies on sets of three (trine) or four (tetrad) equiprobable 
symmetric nonorthogonal polarization states, we obtain a mutual information that exceeds the 
maximum value attainable using conventional (von Neumann) polarization measurements. 
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I. INTRODUCTION 

The basic building block in quantum communications 
and information is the qubit, that is a physical system 
with two orthogonal quantum states. A simple example 
is the horizontal and vertical states of polarization asso- 
ciated with a single photon. It is also possible to prepare 
any superposition of vertical and horizontal polarization 
and these correspond to other states of linear, circular 
and elliptical polarization. In quantum communications 
a transmitting party (Alice) might select from a num- 
ber of possible nonorthogonal polarization states to send 
to the receiving party (Bob). This idea is the basis of 
the emerging technology of quantum key distribution [Q . 
Signal states with nonorthogonal polarizations may also 
arise as a result of noisy communications channels acting 
on orthogonal signal states. For certain types of noisy 
channels, the throughput of information is actually max- 
imized for nonorthogonal signal states Q . 

The message encoded in the transmitted photons must 
be retrieved by measurement. This can be achieved ei- 
ther by a conventional (von Neumann) measurement or 
by means of a generalized measurement ^, ^ ||. The 
von Neumann measurement gives answers corresponding 
to one of a pair of orthogonal polarization states. The 
generalized measurements presented in this paper give 
one of three or four possible outcomes corresponding to 
the elements of a probability operator measure or POM 

, also called a positive operator- valued measure Q . 

If there are only two possible states then it is known 
that a single von Neumann measurement with two pos- 
sible outcomes will minimize the probability for error in 
identifying the state ^. State discrimination with 
this minimum allowed error probability has been demon- 
strated for weak pulses of polarized light |Q . With only 
two possible states, it is also possible to discriminate 



between the two possibilities without error if we allow 
for the possibility of an inconclusive measurement out- 
come 1^ ^, |ll|, Near error-free discrimination 
between two nonorthogonal polarization states was first 
demonstrated by Huttner et al. More recently, we 
have shown that it is possible to discriminate between 
nonorthogonal polarization states iQ with the probabil- 
ity for inconclusive outcomes at the quantum limit de- 
termined by Ivanovic, Dieks and Peres. 

Unambiguous discrimination between two nonorthog- 
onal states of a qubit is an example of a generalized mea- 
surement 1^, Q in that it requires three possible mea- 
surement outcomes but there are only two orthogonal 
polarizations. Measurements of this kind are required if 
we wish to perform an optimum measurement on a qubit 
prepared in more than two different states. In this paper 
we will describe experiments on polarized light prepared 
in one of three or four nonorthogonal states. The three 
states form a so-called trine ensemble ||, |l5[ |l^ , and take 
the form of three states of linear polarization separated 
by an angle of 60° . These states may be represented on 
the Poincare sphere [O as three equidistant points on 
the equator (see Fig.Tl left). The four states form a 
tetrad ensemble |lq] and take the form of two linearly 
and two elliptically polarized states. These states form 
the vertices of a tetrahedron on the Poincare sphere (see 
Fig. 0, right). 

In this paper we describe our experimental realizations 
of two types of optimal measurements on these trine and 
tetrad states. The first gives the minimum probability 
of error in identifying the state 19, |2^. The sec- 
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ond provides the knowledge that the qubit was definitely 
not prepared in one out of the three or four possibilities, 
but does not discriminate further between the remain- 
ing possibilities |2l|] . This measurement is known to give 
the accessible information (i.e., the mutual information 
maximized with respect to detection strategy) between 
transmitter and receiver for equal prior probabilities for 
the trine states We calculate the mutual informa- 

tion attainable from our experiments and show that it 
exceeds that which can be obtained by the best von Neu- 
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FIG. 1: The trine (left) and tetrad (right) states shown as po- 
larization directions (top), and on the Poincare sphere (shown 
from two different orientations, middle and lower). Anti-trine 
(orthogonal) states are also shown (dashed) in the top and 
middle left diagrams and antitetrad polarization states in the 
top right diagram. 



mann measurement. 

The rest of the paper is organized as follows. In Section 
U the relevant theory of probability operator measures 



(POMs) is outlined. In Section [II our implementation of 
such POMs in optical networks is described theoretically, 
followed in Section IV by presentation of the experimen- 
tal setup and results. 



II. THEORY 

Consider a communication channel in which photons 
are prepared, with equal prior probabilities, in one of 
three or four known states of polarization. Our task is to 
perform a measurement on the light and so retrieve in- 
formation about the prepared state. We are interested in 
two types of measurement. The first is designed to mini- 
mize the probability of error in identifying the state cor- 



rectly, and the second is aimed at maximizing the mutual 
information associated with the communications channel. 

Our signal states are realizations of either the trine 
1^, |6| or tetrad jl^ ensembles. The polarization 
states making up our trine ensemble are 
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l{\h) + Vs\v) 

-l{\h)-V3\v) 



\h), 



(1) 



corresponding to states of linear polarization separated 
by 60° (see Fig. 1, left). The kets \h) and \v) are states 
of horizontal and vertical polarization respectively. The 
superscript label denotes the fact that there are three 
members of the trine ensemble. The states making up 
our tetrad ensemble are 



l^2> = 



1^3) 



_L(_|/,) + y2e-^W3|,))^ 
\h), 



(2) 



where the superscript label again denotes the number of 
states in the ensemble. The first two states correspond 
to elliptical polarizations and the second two represent 
linear polarizations. We also require the states making 
up the antitrine and antitetrad ensembles. These are the 
polarization states that are orthogonal to the trine and 
tetrad states. We denote the members of the antitrine 
and antitetrad ensembles by an overbar. The members 
of the antitrine ensemble are 



I (V3\h) 



(3) 



and those of the antitetrad ensemble are 

1^^) = (V2e2-/3|/.) + |.)) , 

= -l=(^V2e-'-^/'\h) + \v)), 
1 



m = \v)- 



(4) 



The optimum detection strategies require generalized 
measurements which may be described in terms of the 
elements of a POM Each of the possible results (yj) 
of the measurement corresponds to a Hermitian opera- 
tor, Ilj, such that the probability of obtaining this result 
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given that the polarization was prepared in the state [ip 
is 



(5) 



where N is the number of states making up the ensemble 
(three for the trine and four for the tetrad). 

The requirement that the expectation value of Ilj is a 
probability leads to the constraints that all the eigenval- 
ues of IIj are positive or zero and that 



En. 



(6) 



where the sum runs over all possible results of the mea- 
surement. This description of a measurement is more 
general than that associated with a conventional (von 
Neumann) measurement. For a von Neumann measure- 
ment the POM elements are projectors onto the or- 
thonormal eigenstates of the measured observable. In 
general, however, the POM elements will be neither nor- 
malized nor orthogonal. 



A. Minimum error probability 

The problem of finding the measurement which pro- 
vides the minimum error probability is an old one and it 
is instructive to present it in a more general form than is 
strictly necessary for the description of our experiments. 
Consider a set of possible prepared states with density 
matrices pk and let these occur with prior probabilities 
Pk- The lowest error probability will be achieved by a 
measurement in which we associate a POM element 11^ 
with the statement that the prepared state was pk- The 
error probability is then 



-Pe = 1 - y^PfcTr(/9fcnfc), 



(7) 



where the sum gives the probability that the state will be 
correctly identified. The necessary and sufficient condi- 
tions on the POM element for this probability to attain 
its minimum value EOf are 



^jiPjP] - Pfc/Ofc)nfe = V (j, fc). 



J- 



(8) 
(9) 



The general solution for an arbitrary set of states and 
prior probabilities is unknown but the solution is known 
for some special cases. We are interested in the pure 
states making up the trine and tetrad ensembles. These 
states are overcomplete in that we can resolve the identity 
as a weighted sum of the corresponding projectors 



N 



N 



^|^f)(V'f| = i. 



(10) 



If we impose the condition that the states occur with 
equal prior probabilities then the POM elements that 
minimize the error probability Eq. (|^) are |^, ^ 



(11) 



This gives for the minimum error probability the value 
PI 



Pe (min) 
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This is I for the trine ensemble and ^ for the tetrad. 



(12) 



The 



remaining problem is to determine how we can design 
experiments for which the POM elements are those given 
in Eq. (pi]). Before addressing this problem we consider 
another scenario in quantum signal detection. 



B. Mutual information 

The minimum error probability is used as a measure 
of how precisely one can identify each signal states on 
average. If instead of identifying each individual state 
as precisely as possible, one is interested in extracting as 
much classical information as possible from a sequence of 
quantum states, a measure commonly used is the Shan- 
non mutual information. 

Let us suppose that Alice prepares the ensemble of 
quantum states {IV'fc^)} to encode the classical message 
{xk}, where each element appears according to a priori 
probability pfc . Let us call this ensemble X = {|'0fc'):Pfc}- 
The Shannon entropy of X , 



H{X) = -J2PklogPk, 



(13) 



quantifies how much we know about X just from the 
a priori probabilities pk- Actually, the minimum value 
of H{X) is zero, corresponding to knowing everything 
about X, and the maximum value of H{X) corresponds 
to knowing nothing better than a random guess for each 
element of X, so it is more natural to say that H{X) 
measures what we don't know about X . Bob now applies 
a measurement with M possible outcomes {y^}, charac- 
terized by the POM elements {11^}. The outcome, say, 
gives Bob more information about X. The new prob- 
ability distribution conditioned by yj is 



P{yjW)Pk 
Piyj) 



(14) 



where P{yj) = J2i Piyj\'^i^)Pi is the probability of hav- 
ing yj, and P{yj\ipk) is defined by Eq. (||). One can 
then define the average conditional entropy by 



H{X\Y) = -^P(y,)E^('/^fe^l2^^)l°g^(^^^lyj-) 



fe=i 



(15) 
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This quantifies the remaining uncertainty about X after 
having the knowledge of the conditioning variable Y = 
{Uj, P{yj)}. The information gained as a result of making 
the measurement Y is naturally defined by 



I{X : Y) 



H{X)-H{X\Y), 

M N 

EE 



j=i k=i 



log (^Mi^ 



(16) 



(17) 



This is the Shannon mutual information between X and 
Y. In general, minimizing the average error probabil- 
ity and maximizing the mutual information are different 
problems, and in fact, one often finds the optimal POMs 
are different in each case. To extract as much information 
as possible. Bob has to maximize the mutual information 
with respect to the POM. The maximum value 



/acc = max/(X : Y), 
{n,} ^ 



(18) 



is called the accessible information of the ensemble X. 
To obtain the accessible information, it is not neces- 
sary for the number of measurement outcomes to be the 
same as the number of signal states. (In the experiments 
described in this paper, however, these will always be 
equal.) 

The trine and tetrad ensembles comprise nonorthogo- 
nal states and hence errors in determining each individ- 
ual state are inevitable. It is possible, however, to achieve 
asymptotically error-free transmission of information by 
employing block coding schemes based on sequences of 
the states. The accessible information has a practical 
meaning in this problem. Let {11^} be the POM attain- 
ing the accessible information for X. We consider block 
sequences of {\ip^)} of length n and choose 2*^ such se- 
quences (codewords) to code classical messages. Suppose 
that Bob applies {Ilj} on each states of a received block 
sequence separately, and guess the codeword sent based 
on the outcome with any classical procedure. Then if 
k/n < I Acc, Alice and Bob can communicate with an 
arbitrarily small error by choosing n large enough [ p4| . 

Thus it is important to find the optimal POM and, in 
particular, to demonstrate its physical implementation 
in practice, for developing communication technology 
though a quantum limited channel. The general prob- 
lem of finding the accessible information for any given 
set of signal states remains unsolved. The complete re- 
sults are known for a few special cases jl^, . For 
the trine states with equal prior probabilities the accessi- 
ble information is attained for a measurement with POM 
elements based on the antitrine states 



(19) 



The optimality of this measurement strategy was con- 
jectured in 1 15: ^3 and finally proven by Sasaki et 



al. [p2| . The corresponding accessible information is 
log2 |~ = 0.585 bits. This clearly exceeds the value 
(— i + i log2 3) — 0.459 bits which is the maximum mu- 
tual information attainable for a von Neumann measure- 
ment corresponding to finding one of two orthogonal po- 
larizations. 

The accessible information for the tetrad ensemble is 
not so strongly established as it is for the trine ensemble. 
Davies [|l^ has conjectured that it corresponds to the 
POM elements 



(20) 



associated with the antitetrad states. The mutual infor- 
mation associated with this measurement strategy (con- 
jectured to be the accessible information) is log2 | — 
0.415 bits. This again exceeds the value of |(1 — 
i log2 3) — 0.311 bits attained for the best possible von 
Neumann measurement. 

It should be noted that the measurement strategy 
based on the antitrine and antitetrad states only iden- 
tifies the correct state with probability 1/{N — 1). How- 
ever, it obtains more information than the minimum er- 
ror strategy by identifying for certain one of the states 
that was not sentlSCll. 



III. OPTICAL IMPLEMENTATION 

Having described the results that are predicted for var- 
ious optimal measurements of the trine and tetrad states, 
we now turn to the implementation of such optimal gen- 
eralized measurements using specific optical networks to 
measure photon polarization states. The theoretical op- 
eration of the optical networks will be described in the 



next subsections, followed in Section [V by the descrip- 



tion and results of the actual experiments we performed. 

The generalized measurements described in the previ- 
ous section have to be implemented in practice as von 
Neumann measurements with simple yes/no results, but 
in an enlarged state space ^, |l^, In our exper- 
iments, the state space is enlarged by incorporating an 
interferometer into the measurement apparatus. The en- 
largement of the state space is realized by the introduc- 
tion of vacuum modes entering through the unused input 
port. A single interferometer allows up to four mutu- 
ally exclusive (orthogonal) possible results from a single 
photon input state. 

In order to describe clearly how our apparatus works, 
we first set out our sign conventions for the optical com- 
ponents. The main polarizing beam splitters are oriented 
so that horizontally polarized photons, \h), go straight 
through, while vertically polarized photons, are de- 
flected through an angle of tt/2. For the non-polarizing 
beam splitter, the transmission and reflection coefficients 
are assumed to be equal (both l/-\/2), and since there 
is only one (non-vacuum) input to it, we take there to 
be no phase difference between the two outputs. For 
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FIG. 2: Optical network for the trine generalized measure- 
ments. PBS = polarizing beam splitter, WP — waveplate 
and PD = photodetector (labels correspond with Fig. 
The dotted lines AA, BB and CC are referred to in the text. 
The Jones matrix corresponding to each waveplate is shown 
alongside it (A2(a/2), etc.) with the angle indicating the ori- 
entation of the axes. 



the quarter and half waveplates, their effect when placed 
with their fast axis at an arbitrary angle (measured an- 
ticlockwise from the horizontal viewed in the direction 
of travel of the photons) can be described in terms of 
Jones matrices (see Appendix ^). In particular, a half 
waveplate placed with its axes at an angle of 7r/4 to the 
vertical/horizontal rotates \v) to \h), and \h) to \v), as 
is well known. A half waveplate with axes at n/S to the 
vertical/horizontal and a quarter waveplate with axes at 
an angle of 7r/4 both produce maximum mixing between 
the \v) and \h) states, which is what is required to analyze 
the state obtained when the two arms of the interferome- 
ter are recombined. The quarter waveplate is used in the 
tetrad case, where there is a phase factor of i between 
the components in each arm. A half waveplate placed 
at the special angle a/2 — ^ arcsin(l/V3) ~ 17.63°, is 
used to rotate the photon until the amplitude of the \h) 
component is reduced to y/2/3 of its initial value. 



A. Trine 

An optical network that realizes the minimum error 
probability for the three trine states is depicted in Fig. B. 
This is a variation of recently proposed networks p2| . 
Our trine input states are represented in terms of photon 



After passing through the polarizing beam splitter PBSl, 
the \h) component of each state goes into the upper arm 
of the interferometer and the \ v) component goes into the 
lower arm. The half- waveplate WP5 then rotates the \h) 
component such that at the stage specified by the line 
AA in Fig. ||, the states have been transformed to 

V3 



iM 



\/2|/iu) 
V2\hv) 
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+ — K) 



IV'; 



[V2\l 



(21) 



1 1 

1 1 

1 

where the subscripts U and L denote the upper and lower 
arms of the interferometer respectively. The \hu) part 
of these states passes out of the interferometer through 
PBS2 and into photodetector PD3. If the input state was 
IV'I), there is a probability of | that PD3 will detect the 
photon. If the input state was \tpf) or ji/;!), there is a 
probability of ^ for each state that PD3 will detect the 
photon. Turning this round, if PD3 detects the photon, 
there is a probability of | that the input state was 
and a probability of ^ each that the input state was 
or IV^i). 

Continuing through the interferometer, the half wave- 
plate WPG rotates the component in the lower arm from 
I^l) to so at the stage specified by BB in Fig. ||, 

the states are 
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It is necessary to arrange the path lengths in the arms 
of the interferometer such that there is a relative phase 
of 7r/2 between the U and L components. Then, after 
the polarizing beam splitter PBS4 recombines the two 
components, the half waveplate WP9 mixes the two po- 
larizations such that, at the stage indicated by CC on 
Fig. |[ the states become 
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(23) 



Beam splitter PBS6 separates the \h) and \v) compo- 
nents so they go into detectors PDl and PD2 respec- 
tively. From this we can see that if PDl detects the 
photon, there is a probability of | that the input state 
was and a probability of ^ each that it was j-^f) 

or IV'I), similarly for PD2 with states and jV'l) in- 
terchanged. In other words, after passing through this 
optical network, the final states (denoted by subscript 
F) are given by 
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||P^3) + -L|Pm)--L|P..2). 



(24) 



This is all normalized so the probabilities are given by 
the square of the amplitudes of each photodetector state. 
(The photodetector states \PD1), \PD2) and \PDi) are 
necessarily mutually orthogonal as only one photodetec- 
tor out of the three can detect the photon.) 

In order to implement the POM corresponding to the 
maximum mutual information, instead of constructing 
the optical network that corresponds to POM elements 
and applying it to the trine states, we use 
the optical network just described (corresponding to the 
POM elements I) ^^'^ apply it to the antitrine 

states IV'j)- This is obviously completely equivalent the- 
oretically and more practical experimentally. We find 
that the state should never trigger photodetector 
j. It will, however, lead to detection in either of the re- 
maining two photodetectors with equal probability (5) 



B. Tetrad 

The optical network we used to implements the optimal 
measurement for the tetrad states is shown in Fig. ^. It 
is similar to the trine network in Fig. ^, with an extra 
detector PD4. The key differences are a non-polarizing 
beam splitter (NPBS) in place of PBS2 and WP5 rotated 
to an angle of (7r/2 -I- a)/2. The four tetrad states are 
represented as polarized photons as in Eq. (H), 
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l^{-\h)+V2\v)), 
\h). 



The tetrad network can be understood most straight- 
forwardly by noting the effect of the half waveplate, WP5, 




PD4 PBS4 



input state 



FIG. 3; Optical network for the tetrad generalized measure- 
ment. NPBS = non-polarizing beam splitter, PBS = polariz- 
ing beam splitter, WP = waveplate and PD = photodetector 
(labels correspond with Fig. The dotted lines AA, BB 
and CC are referred to in the text. The Jones matrix corre- 
sponding to each waveplate is shown alongside it (A2(q/2), 
etc.) with the angle indicating the orientation of the axes. 



on each of the four tetrad states. Denoting the operation 
of WPS by the Jones matrix A2(7r/4 + a/2), 

A2(V4 + a/2)|^f) = «|V2>, 
A2(V4 + a/2)|^2'> = -*l^f>, 
A2(V4 + a/2)|^|> 
A2(V4 + a/2)|^l) 



1^1) 
1^1) 



(25) 



so that, apart from phase factors, state \ip\) is converted 
into I'i/'j) and vice versa, similarly for states IV'I) and 
I'f/'l). Remembering the first beam splitter is 50/50 non- 
polarizing, the network is thus essentially symmetrical 
in its operation. The full details are given in Appendix 
p. The final result is that state IV'I) reaches photode- 
tector PDl with probability ^ and each of the remaining 
photodetectors with probability i . Similarly each of the 
remaining three states will trigger its associated photode- 
tector with probability ^ and the others with probability 
1 

6- _ 

The maximum mutual information measurement is re- 
alized in the same way as for the trine ensemble, by using 
the antitetrad states as input. If the antitetrad states 
are introduced into this network then we find that the 
state l-i/J^) should never trigger photodetector j. It will, 
however, lead to detection in any of the remaining three 
photodetectors with equal probability (i). 
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FIG. 4: Experimental setup for trine and tetrad state mea- 
surements. See text for full description. L = lens, ND = 
neutral density filter to attenuate the light, NPBS — non- 
polarizing beamsplitter, PBS — polarizing beam splitter, PD 
= photodetector, TS = Titanium-Sapphire laser, WP = wave- 
plate, either half or quarter. 



IV. EXPERIMENTS 

Figure]^ shows the experimental arrangements used for 
the trine and tetrad experiments. The apparatus was eas- 
ily converted between the two experiments by changing 
WP7 from a half to a quarter waveplate and exchanging 
PBS2 with NPBS. We first discuss the general arrange- 
ment and techniques that are common to both experi- 
ments (see also Ref. |0, where the same experimental 
setup is described in more detail) . The individual details 
and results of the two experiments arc then described 
separately in Sections IV A and |VB, 



The light source was a mode-locked light Ti:Sapphire 
laser operating at 780 nm with a repetition rate of 80.3 
MHz. The pulse duration of 1 ps corresponds to a pulse 
length of 300 /im. The repetition rate of the laser en- 
sured that there was only one pulse in the optical system 
at any one time. The length of each pulse was much 
shorter than the path length of the interferometer. This 
meant that each pulse could only interact with one opti- 
cal component at a time. 



The laser light was passed through a 60 /im pinhole to 
produce a spherical wavefront. Lens L2 then focused the 
light onto the photodiodes PDl-5 (Centronix, BPX65) 
through the interferometer. This arrangement minimized 
the phase distortions across the wavefront as it passed 
through the non-ideal interfaces of the beamsplitters, 
maximizing the visibility of the interference. The focus 
on the detectors was much smaller than the 1 mm^ de- 
tector area, yielding almost 100% spatial collection of the 
light. Losses due to the anti-reflective coatings on the 
beamsplitters were also small, so almost all the photons 
entering the interferometer reached the detectors. 

The Rochon polarizing beamsplitter PBSl was used to 
polarize the light with a linearity of better than 5000:1. 
The WoUaston polarizing beamsplitter PBS6 was chosen 
for its similar efScient polarizing properties. The wave- 
plates WP2-4 were then used to prepare the input states, 
see Appendix 

In these experiments, we measure the average current 
from the photodiodes rather than counting discrete pho- 
ton events. To detect the small quantities of light, phase 
sensitive detection |p7|| was employed using a chopper 
wheel, differential amplifier and a lock-in amplifier. The 
differential amplifier was used to reduce the noise of the 
signal by canceling out common ground loop noise us- 
ing a darkened detector. The detectors PDl-5 had a 
nominal quantum efficiency of 83% at 780 nm and were 
terminated by 10 Mil. At an average laser intensity of 
0.1 photons per pulse this corresponds to a detected volt- 
age of 11 /iV. With time constants of 10 to 30 seconds, 
light levels of 0.01 photons per pulse were detectable with 
an accuracy of approximately 1% and an uncertainty of 
approximately ±2.5%. 

Neutral density filters and rotation of the waveplate 
WPl, in conjunction with PBSl, attenuated the light en- 
tering the interferometer to an average of 0.1 photons per 
pulse. A pick-off beam was measured on PD6 using phase 
sensitive detection with a separate lock-in amplifier. This 
photodiode was used to normalize output amplitude vari- 
ations of the laser light when monitoring PDl-5 during 
all measurements and calibrations. These detectors, sup- 
plied in parallel by a single 9 V source, were calibrated 
relative to each other to better than 1% by changing the 
distribution of light around the interferometer. 

All the waveplates used in the experiments were housed 
in specially designed mounts. The waveplate was freely 
rotated to zero its position, measured optically, and 
locked in position. It could subsequently be moved by 
predefined fixed angles, including 17.63° and 27.37°, with 
an accuracy of ±0.05° and repeatability of ±0.01°. This 
enabled quick and accurate changes to the input polar- 
ization states in the trine and tetrad experiments. The 
waveplates were measured to maintain the linearity of 
the polarization to 1 part in 2000. 

The interferometer for the trine experiment was identi- 
cal to that described in |Q . It was constructed from four 
polarizing beamsplitters mounted on a machined mono- 
lithic aluminum block. In a conventional Mach-Zehnder 
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operation, the extinction ratio of the output of the in- 
terferometer on detectors PDl and PD2 was regularly 
measured to be 200:1. The system was stable enough to 
be left for over half an hour without significantly impair- 
ing this extinction ratio. 

For the tetrad experiment the first beamsplitter, 
NPBS, was non-polarizing with a nominally equal split- 
ting ratio at 780 nm. As the splitting ratio was highly 
wavelength dependent, the Ti: Sapphire laser was tuned 
to obtain the most equal splitting ratio, which was esti- 
mated to be 49.5 : 50.5. 



A. Trine Experiment 



100.0 



C° 83.3 



a 66.7 
Pi 

g 50.0 



B 33.3 



16.7 




|«A> l<A> l^3> 



The interferometer was aligned by using it as a con- 
ventional Mach-Zehnder interferometer, with high light 
powers (>> 1 photon per pulse). The input polarization 
was set to be linear at 45° to the horizontal and WP5 
was rotated to produce vertically polarized light. After 
alignment, the light entering the interferometer was at- 
tenuated to 0.1 photons per pulse and WP5 was rotated 
by 17.63° so that the polarization of the horizontal com- 
ponent transmitted by PBS2 was rotated anti-clockwise 
35.27° from the horizontal. This reduced the amplitude 
of the horizontal component to \/2/3 of its initial value. 
The 6 input trine and antitrine states were constructed 
by rotating waveplate WP2 in 15 degree intervals (WPS, 4 
are not present in this experiment) . The results were ob- 
tained by taking measurements for one detector (PDl-3) 
as the 6 input states were changed. Then the next de- 
tector was measured for the same 6 states. 

After the relative detector calibrations were taken into 
account, we normalized the results for each input state 
such that the total average measured probability on pho- 
todiodes PDl-3 summed to unity. The wl% total leak- 
age from PBS3 and PBS4 into detectors PD4 and PD5 
is not significant. For the purposes of mutual informa- 
tion it is the ratio of counts in photodiodes PDl-3 that is 
important, because it is these detectors that distinguish 
between the input states. The results are shown in Fig. 

i 

The dependence of the measurements on the angle of 
WP5 was then investigated. The experiment was re- 
peated as the angle of the WP5 was varied either side 
of the theoretical value of 17.63°. The alignment of the 
interferometer was checked and adjusted, if necessary, be- 
fore taking data at each separate angle. Figure |^ shows 
the root mean squared deviation of the 18 trine and an- 
titrine measurements from the ideal values for each angle. 

Before we discuss the results it is useful to review the 
sources of error. The largest is the noise of the detec- 
tor signal and from the phase sensitive detection pro- 
cessing. The observed uncertainty in the measurements 
of the output ports where a zero count rate is expected 
in the antitrine experiment is approximately 2.5%. This 
level of error is often similar to the average signal present. 
We therefore conclude that the zero count measurements 
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FIG. 5: Histogram showing the trine and antitrine results. 
Normalization ensures the sum of counts in detectors 1, 2 
and 3 is 100% for each input state. The theoretical ratios are 
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FIG. 6: The angle of WPS is varied around the theoretical 
value of a/2 — 17.63°. The graph shows the RMS deviation 
of the results from the optimum theoretical results. 



are limited by the detector noise if the measured signal is 
of order 2%. This has important implications for t he er- 
rors in the derived mutual information, see Section IV C . 
The noise is mainly attributable to the weakness of these 
signals in comparison to the magnitude of ground loop 
and stray light noise, and also variations in the chopping 
wheel frequency leading to varying offsets. We estimate 
that the error due to the amplitude normalization pro- 
cedure involving PD6 is less than 0.5%, which is much 
smaller than the TiiSapphire amplitude variations of up 
to 4% over a few minutes. 

A second large source of error is the non-ideal nature 
of the beamsplitters. The calibrated birefringence prop- 
erties are given in |l3]. It is sufficient here to state that 
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when purely horizontal light is input into PSB3, a power 
leakage of approximately 0.9% towards PBS4 was mea- 
sured. Even this small amount of reflected light can have 
large effects on the ratio of light reaching detectors PDl 
and PD2 due to interference effects. 

The drift of the interferometer during measurements 
was negligible compared to the above errors. This was 
evaluated by monitoring the level of destructive interfer- 
ence that could be observed on detectors PDl and PD2 
over several hours using high light intensities. 

The results in Fig. |g demonstrate that the trine and 
antitrine measurements are in close agreement with the- 
oretical predictions. The ratios § • g • g and ^ : ^ : 
are clearly visible for the respective measurements. The 
RMS deviation of the trine and antitrine results is 3.8% 
from the theoretically expected values. Of specific im- 
portance are the antitrine measurements which are the- 
oretically expected to be zero. The experimental mea- 
surements are indeed very close to zero, with an average 
value of 1.6%. Fig. ||also shows that the minimum RMS 
error from the optimum theoretical values was obtained 
when the waveplate WPS was at the angle corresponding 
to the theoretical value, within the limits imposed by the 
small number of data points. 

Close inspection of Fig. || demonstrates the effect 
of the leakage of PBS3. For antitrine input state 
the light on PDl and PD2 is split into the ratio 46:54. 
In an ideal experiment there would be no light travel- 
ing towards PBS4 from PBS2 and the split would be 
50:50. Indeed, this ratio is observed to better than ±0.2% 
if an opaque card is placed between PBS2 and PBS4. 
The leakage from PBS2, when interfering with the light 
from PBS3, is enough to skew the results by 8%. This 
demonstrates the sensitivity of the apparatus to practical 
sources of error. 



B. Tetrad Experiment 
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FIG. 7: Histogram showing the tetrad results, normalized so 
that the sum of the counts in detectors 1 to 4 is 100% for each 
input state. The theoretical ratios are ^ ■ ^ ■ '■ ^■ 
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FIG. 8: Histogram showing the antitetrad results, normalized 
so that the sum of the counts in detectors 1 to 4 is 100% for 
each input state. The theoretical ratios are 1:^:^:0. 



The first beamsplitter PBSl in the interferometer was 
changed to a non-polarizing beamsplitter. Alignment of 
the interferometer was achieved by simulating a conven- 
tional Mach-Zehnder operation with vertically polarized 
input light. Waveplate WP5 was aligned with the slow 
axis in the vertical direction and WP7, a quarter wave- 
plate, rotated so that there was complete mixing of the 
output states of the interferometer. 

After the alignment, waveplate WP5 was rotated 
clockwise by 54.74°/2 = 27.37°. This corresponds to set- 
ting the fast axis at 125.26°/2 = 54.747° anti-clockwise 
from the horizontal. The angle between two tetrad states 
is 125.26° on the Poincare sphere, or 54.74° in polariza- 
tion. 

The four tetrad and four antitetrad polarization states 
required to perform this experiment were constructed us- 
ing waveplates WP2-4 as described in Appendix We 
were able to change between the 8 input states in a mat- 
ter of seconds using the discrete predefined angles in our 



waveplate holders. 

The experiment was performed by measuring the sig- 
nal on detectors PDl-4 as the input states were changed 
between the 8 states. The results of the tetrad and an- 
titetrad measurements are shown in Figs. Q and ^ after 
calibration and normalization. 

As with the trine measurements, it is clear that the 
experimental results are in good agreement with the the- 
oretical predictions. The | • ^ ■ ^ • ^ and i : ^ : ^ : 
ratios for the tetrad and antitetrad states respectively 
are clearly evident. The overall RMS deviation from the 
theoretical values is 2.9%. Attention is drawn to the an- 
titetrad states where the count rate is theoretically zero. 
The average count rate in these four measurements is 
0.9%. This low count rate is very significant for the mu- 
tual information that can be obtained, see next subsec- 
tion. 
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C. Experimental Mutual Information 

The mutual information for both experiments was cal- 
culated in bits (log2) using Eq. (|l7|). The conditional 
probabilities were obtained from the experiment. The 
values for pk were set to | and j for the trine and tetrad 
cases respectively, corresponding to equal prior proba- 
bilities for the input states. The results, summarized in 
Table |, are in excellent agreement with the theoretical 
predictions using the model for detector noise described 
in Appendix The mutual information of both sets of 
antistates is significantly greater than the best possible 
von Neumann measurement. 

Experimental errors were estimated in a Monte Carlo 
simulation, assuming a flat ±2.5% error distribution in 
the measurements. The theoretical sensitivity of the mu- 
tual information to measurement error was estimated us- 
ing a simple error model, see Appendix |c[ The param- 
eter, r, was used to quantify the level of noise. The av- 
erage signal of the near-zero measurements in the trine 
and tetrad experiments, 1.6% and 0.9% respectively, was 
used to obtain the parameter F = 0.952 and 0.964 re- 
spectively. 

In the antitrine and antitetrad cases it was found that 
the sensitivity of the mutual information obtained is al- 
most entirely determined by the noise-induced counts at 
the detector that would, in an ideal experiment, register 
no counts. This is a consequence of the logarithmic na- 
ture of the mutual information. In an ideal experiment, 
a count in detector PDj means that state I'tpj') was def- 
initely not the input state. The knowledge of the input 
state has therefore been increased by a large amount. 
Any deviation from theoretically zero count rates in the 
experiment will lead to a rapid increase in the errors as- 
sociated with this deduction. Hence, the mutual infor- 
mation will decrease rapidly. Nevertheless, in our ex- 
periments, the mutual information obtained for both the 
antitrine and antitetrad states is still significantly higher 
than the best possible von Neumann measurement. To 
our knowledge this is first time this has been achieved 
experimentally. Measurement errors due to noise are the 
limiting factor determining the mutual information that 
can be achieved experimentally. The largest source of 
noise is due to ground loop noise in the detectors. The 
use of single photon detectors gated with the pulses of 
light should reduce this noise considerably and yield fur- 
ther increases in the mutual information obtained. 



V. CONCLUSIONS 

In quantum communications the possible signal states 
can be nonorthogonal. This leads to problems in re- 
covering the signal by detection. In general, we must 
accept the presence of errors. We have realized detec- 
tion strategies that either minimise the error probability 
or maximise the mutual information for the trine and 
tetrad states. The mutual information is an important 



parameter in communications, quantifying the maximum 
amount of information that can be extracted from a sig- 
nal. In the context of measuring the polarization of light, 
probability operator measures provide a means of attain- 
ing higher mutual information values than von Neuman 
measurements. We have presented two separate experi- 
ments using heavily attenuated pulses of light to verify 
the optimal POM on trine and tetrad polarization states 
of light. We have demonstrated for the first time that 
the mutual information obtainable using POMs can be 
significantly higher than the maximum possible mutual 
information in a von Neuman measurement. This illus- 
trates the principle that generalised measurements can be 
of greater utility than their more familiar von Neuman 
counterparts. 

Using a flexible free space interferometer we were able 
to vary all aspects of the measurement process determin- 
istically. The mutual information is extremely sensitive 
to the noise of the detectors in which very low counts 
rates are expected. We have demonstrated that practical 
communications using POMs are feasible. 
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APPENDIX A: MAKING AN ARBITRARY 
POLARIZATION STATE 

The convention used for the orientation of the wave- 
plates is as follows. Viewed in the direction of propaga- 
tion, the fast axis of the waveplate makes an angle 0/2 
with the horizontal in an anti-clockwise direction, and 
the slow axis similarly makes an angle (/)/2 with the ver- 
tical. Expressed in Jones matrix notation [^8| , the action 
of a half waveplate is thus 



A2(0/2) 



cos (p sm 
sin d) — cos 




(Al) 



where h and v are the amplitudes of horizontal and ver- 
tical polarization in the state respectively. Similarly, the 
action of a quarter waveplate with its fast axis at an angle 
9/2 to the horizontal is 

A,{e/2) ( ^ 1 



sin2(6l/2) -|-icos2(6'/2) 



cos2(6l/2) -|-isin^(6l/2) 




1 / cos — i sin 9 \ ( ^ 
y/2 \ sm9 — cos 9 — i J \v 



(A2) 
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States Experiment Ideal Theory Noisy Theory von Neumann 



trine 


'J-JL^ -O.OS 


0.333 


0.302 


0.459 


antitrine 


0.491 t^oll 


0.585 


0.486 


0.459 


tetrad 


0.209 IHlg 


0.208 


0.194 


0.311 


antitetrad 


nco +0.09 


0.415 


0.355 


0.311 



TABLE I: Mutual information expressed in bits for the trine and tetrad states and their antistates. The values indicate the 
mutual information obtained in the experiment, in an ideal experiment, in an ideal experiment with detector noise analyzed 
using the error model described in Appendix B, and the maximum allowed mutual information in a von Neumann measurement. 



WP2 WP3 WP4 



lh>. 



source 



output state 



e/2 (^/2 -e/2 

angle of axes of waveplate to vertical 

FIG. 9: Optical arrangement for making an arbitrary state. 
Waveplates WP2 and WP4 are quarter waveplates and WP3 
is a half waveplate. 



where the overall phase of e*'^'^/^ has been dropped. 

As can easily be verified using the above expressions, 
an arbitrary polarization state can be made from hori- 
zontally polarized input light using a sequence of three 
waveplates, quarter-half-quarter, at the appropriate an- 
gles, see Fig. ^. If the input state is \h) (horizontal 
polarization), and the orthogonal polarization is \v), the 
output state, {ip), is given by 



cos f3\h) + e^'^ sin/3 |w) 



(A3) 



where sm(j) = — sin/3 sin 7 and tan^ = tan/3 cos 7, with 
9, (j) specifying the orientation of the three waveplates as 
given in Fig.^. 



APPENDIX B: DETAILS OF TETRAD OPTICAL 
IMPLEMENTATION 

Starting from the input states given by Eq. (|), at the 
stage indicated by the line AA on Fig. H, we have 



I^i)a 

l^2>A 
I^3)a 



1 

1 

1 

1 
71 



(lV'f)L+*|V'2')u), 



(|^2>L 



(|V'3'>L + |V'I>U), 

(I^Dl + i^Du). 



In terms of \h) and 

+iV2e2"/3|^^^ 



-zV2e-2"/3|„y) 

IV'3')a = ^ (-l/iL) + V2|«l) + x/3|/iu)) , 

IVDa = ^ (%/3|/il) - \hv) + V2\vv)) ■ (B2) 

Then the components can reach photo-detector PD4 
and the \h\j) components can reach photo-detector PD3, 
while the \vl) and \vu) components continue round the 
lower and upper arms of the interferometer respectively. 
Applying WP6 to converting it to the states 

at the stage indicated by the line BB in Fig. |^ are 

-i\PD3) - \PD4:) ) , 

+t\PD3) - \PD4:) ) , 
|V'3')b = ^ {V2\h^) + V3\PD3) - \PDA)) , 
I^PDb = (\/2|wu) - |^^3) -f V3|Pi?4)) . (B3) 
Finally, the maximal mixing effect of WP9 brings the 



(Bl) 



states into 






H-th = 


7E 


'-iV3\h) - i\v) - i\PD3) ~ \PDA)^ , 


H4)c = 


7! 


- V3\v) + i\PD3) - \PD4)^ , 


l**)c = 




)h) + t\v) + v^\PD3) - \PDA)^ , 


H-t)c = 


7s 


'i\h) + \v) - \PD3) + V3\PD4)^ , (B4) 
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from which the final outcome is easily seen to be 



F — 



I*. 



2/F 



I*I)f 



1 



V6 



PD2) 



x/6 



1 

71 



Pi:>2) 



-7!'™^ + 7!'™^' ^^'^ 

where the amplitudes give the probabilities of each pho- 
todetector finding the photon. 




0.4 0.6 

Noise parameter (r) 



1.0 



FIG. 10: Tfie theoretical variation of the mutual information 
with the detector noise, given by parameter F, for the trine, 
antitrine, tetrad and antitetrad states. Included are read-off 
lines to show the theoretical values of the mutual informa- 
tion for the antitrine and antitetrad measurements, given the 
experimental values of F = 0.952 and 0.964 respectively. 



APPENDIX C: MUTUAL INFORMATION 
ERROR MODEL 

The mutual information obtained is very sensitive to 
noise. In this appendix we present a simple model for 
detector noise and use it to analyze our measurements of 
mutual information. Rather than attempt to account for 
all sources of noise, our model simply assumes that the 
noise randomly distributes a proportion of the detection 
events among all the detectors. 

The probability for detecting a photon in detector j 
(outcome yj) is associated with the POM element Ilj. 
Noise can cause a photocurrent to occur in a detector in 
the absence of any photons. Such noise-inducing events 
are randomly distributed and so we can modify our POM 
elements to account for these noisy events by replacing 
POM elements {Ilj} for an ideal measurement with 



1 



N 



(CI) 



Here N is the number of measurement outcomes and F 
is a positive number (F < 1) that characterizes the noise. 
For F = 1 there is no noise, but for F = 0, noise accounts 
for all detection events. The conditional probabilities in 
the presence of noise become 



(C2) 



Given a channel in which the input states occur 
with equal probability we find that the mutual informa- 
tion becomes 



/({IV'f)}: {y,}) 



1 



N 



1 - ^ ) log ( 1 



log(l + F) 
F 



TV- 1 



(C3) 



If, instead, the antistates \ipk) ^^e used as input, the 
mutual information is 



1 - - — ^\ log ( 1 T 



(C4) 



It is interesting to note that C3 and C4 differ in form 
only through the sign of F. 

We can estimate the value of F by considering the prob- 
ability for the measurement of the antistate j^/j^) to give 
the result yu, corresponding to "not-|'0^)". This is for- 
bidden for an ideal, noiseless experiment but will occur 
with probability {\ — T)/Nui the presence of noise. Com- 
paring this with the experimentally observed values pro- 
vides us with a value for F. 

Figure ^ shows the variation of the mutual informa- 
tion obtained as F is varied. The sensitivity to noise in 
the antitrine and antitetrad states increases markedly as 
F tends to 1. This demonstrates the sensitivity of the 
mutual information to noise in the detector which, in 
an ideal experiment, would register no counts. The mu- 
tual information of trine and tetrad measurements does 
not increase as rapidly because there are no detectors in 
which zero counts are expected. 
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